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ABSTRACT 

Trinucleotide repeat (TNR) expansions and deletions 
are associated with human neurodegenerative 
diseases and prostate cancer. Recent studies have 
pointed to a linkage between oxidative DNA 
damage, base excision repair (BER) and TNR expan- 
sion, which is demonstrated by the observation that 
DNA polymerase p (pol p) gap-filling synthesis acts 
in concert with alternate flap cleavage by flap endo- 
nuclease 1 (FEN1) to mediate CAG repeat expan- 
sions. In this study, we provide the first evidence 
that the repair of a DNA base lesion can also con- 
tribute to CAG repeat deletions that were initiated 
by the formation of hairpins on both the template 
and the damaged strand of a continuous run of 
(CAG) 2 o or (CAG) 2 5 repeats. Most important, we 
found that pol p not only bypassed one part of the 
large template hairpin but also managed to pass 
through almost the entire length of small hairpin. 
The unique hairpin bypass of pol p resulted in large 
and small deletions in coordination with FEN1 alter- 
nate flap cleavage. Our results provide new insight 
into the role of BER in modulating genome stability 
that is associated with human diseases. 

INTRODUCTION 

Trinucleotide repeats (TNRs) are highly polymorphic in 
the human genome and confer their instability via expan- 
sions or deletions (1,2). TNR expansions are known to be 
the cause of more than 40 neurodegenerative diseases, 
including Huntington's disease, myotonic dystrophy and 
fragile X syndrome among others (2-4). Similarly, TNR 
deletions are also closely related to human disorders. 
Epidemiological studies have demonstrated that CAG 
repeat deletions in the androgen receptor are correlated 
with human ovarian (5) and prostate cancer (6,7). 



TNR expansion has been studied extensively in bacteria 
(8), yeast (1), mouse (9), human (10,11) and human cell 
extracts (12). It was initially shown that TNR expansions 
occur during DNA replication of dividing somatic cells 
and recombination of germ cells (3,11,13,14). However, 
later studies have demonstrated that expansions can also 
be mediated by DNA repair in dividing and non-dividing 
somatic cells (3,15,16). It is proposed that the formation of 
non-B form DNA secondary structures during DNA rep- 
lication, recombination and repair contributes to expan- 
sions (11,17). In the human genome, typical non-B form 
structures arising from (CAG) W /(CTG)„, (CTG), 2 /(CAG)„, 
(CGGy(CCG)„ and (GAA)„/(TTC)„ sequence contexts 
include hairpins, tetraplexes and triplexes (11,18). These 
DNA secondary structures can cause DNA polymerase 
pausing and DNA slippage (19,20), replication fork 
stalling and collapse (14,18,21), inhibition of flap 
cleavage by flap endonuclease 1 (FEN1) (22-24), 
trapping of mismatch repair proteins (25,26) and disrup- 
tion of coordination between DNA repair proteins (27). 
All of these events can compromise cellular repair mech- 
anisms for removing the non-B form structures, thereby 
allowing them to be integrated into the human genome for 
TNR expansions. Hence, TNR expansion appears to be 
a consequence of the interactions between DNA second- 
ary structures and DNA replication, repair and recom- 
bination. Compared with the other DNA metabolic 
pathways, DNA repair plays a more versatile role in 
modulating TNR stability because it can repair the 
non-B form DNA structures generated from both DNA 
damages and other DNA metabolic pathways. 

Several DNA repair pathways have been shown to 
modulate TNR expansion during both DNA replication 
and damage repair. The repair pathways that are initiated 
during DNA replication include mismatch repair (28-32) 
and double-stranded DNA break repair, along with DNA 
recombination (1,17,33-38). A mismatch repair protein 
complex, MSH2/MSH3 can bind and stabilize hairpin 
structures, thereby promoting TNR expansion (25,26). 
Double-stranded break repair processes the intermediates 
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resulting from TNR-induced replication blockage and 
fork stalling (14,18,39), therefore facilitates large expan- 
sions. A specific DNA repair pathway can be initiated 
directly by DNA damage in the context of TNR to 
modulate repeat stability (15,27). For example, 
8-oxoguanine (8-oxoG) in TNRs initiates DNA base 
excision repair (BER) through 8-oxoG DNA glycosylase 
(OGG1), which plays an essential role in mediating 
age-dependent somatic CAG or CGG repeat expansions 
in a Huntington's disease and fragile X syndrome mouse 
models (15,40,41). The DNA glycosylase removes the 
oxidized base lesion leaving an abasic site (27), which is 
subsequently incised by apurinic/apyrimidinic (AP) endo- 
nuclease 1 (APE1). This results in single-stranded DNA 
(ssDNA) breakage, which leads to DNA slippage and the 
formation of hairpins and a multi-nucleotide gap (27). 
This further causes the disruption of functional coordin- 
ation between DNA polymerase (3 (pol (3) and FEN1 
leading to an inefficient pol (3 multi-nucleotide gap-filling 
synthesis and FEN1 alternate cleavage, i.e. the cleavage of 
a short S'-flap attached to the 5'-end of a hairpin. This 
subsequently results in production of a ligatable nick for 
ligating a hairpin with newly synthesized repeats leading 
to repeat expansion (27). Thus, inefficient BER leads to 
TNR expansion (42). A multi-nucleotide gap generated 
during BER could also allow the formation of a hairpin 
on the template strand that may mediate TNR deletion 
induced by DNA damages. 

TNR deletions can be induced in human lymphoblasts 
(43), bacterial cells (44) and mouse sperm (45), by DNA- 
damaging agents including ethyl methanesulfonate, 
mitomycin C, ethylnitrosourea, ultraviolet radiation, 
cisplatin and ionizing radiation (43,45,46). Because the 
DNA-damaging agents can induce alkylating DNA 
damage, DNA strand breaks, DNA cross-links that are 
usually subjected to BER and nucleotide excision repair, 
it suggests that BER and NER are actively involved in 
mediating TNR deletion. The transcription-coupled 
NER was found to be associated with TNR deletion in 
mammalian cells (47-49) and in Drosophila (50), thereby 
supporting this notion. NER proteins, Xeroderma 
pigmentosum group A-complementing protein (48), 
Xeroderma pigmentosum group G-complementing 
protein and Cockayne syndrome B protein are also 
known to promote TNR deletion (47,49,50). These 
findings suggest an important role of DNA damage and 
repair in modulating TNR deletion. 

Although significant progress has been made in 
exploring TNR deletion in recent years, our knowledge 
about this process remains limited. This results in poor 
understanding of the mechanisms by which deletions 
arise. Understanding the molecular basis of TNR dele- 
tions and how chemicals may induce deletions through 
DNA damage and repair is critically important for the 
development of new methods for prevention and treat- 
ment of human diseases caused by both TNR deletions 
and expansions (51). Interestingly, in addition to its role 
in promoting CAG repeat expansion (15,27), oxidative 
DNA damage was also shown to induce repeat deletions 
in bacteria (44). This suggests that oxidized base lesions 
and BER play a dual role in causing both expansions and 



deletions. This concept has been further supported by 
in vitro evidence that pol (3 slippage is involved in CAG 
repeat deletion (52). However, it remains unknown if 
or how DNA base lesions and BER cause deletions. 
To explore the potential role of BER in TNR deletions, 
we examined CAG repeat deletions during BER and 
identified the roles of major BER enzymes in the steps 
that could lead to TNR deletions. Surprisingly, we dis- 
covered that repair of a DNA base lesion in the context 
of (CAG) 2 o and (CAG) 2 5 repeats led to repeat deletions 
with the concomitant loss of 1-21 triplet repeats per repair 
event, depending upon the length of repeats. The deletions 
were mediated by pol (3 hairpin bypass in coordination 
with FEN1 alternate flap cleavage. As a result, we 
propose a hypothetical model to describe a unique mech- 
anism underlying TNR deletions involving a base 
lesion-induced loss of TNR through pol P hairpin 
bypass and FEN1 alternate flap cleavage. Our studies 
provide the first evidence that TNR deletion can be 
initiated by DNA base lesions and mediated by BER. 

MATERIALS AND METHODS 

Materials 

DNA oligonucleotides were from Integrated DNA 
Technologies (IDT Inc., Coralville, I A, USA). The 
radionucleotides [y- 32 P] ATP (6000 mCi/mmol) and 
Cordycepin S'-triphosphate 3'-[a- 32 P] (5000 mCi/mmol) 
were from PerkinElmer Inc. (Boston, MA, USA). Micro 
Bio-Spin 6 chromatography columns were purchased 
from Bio-Rad (Hercules, CA, USA). Deoxynucleoside 
S'-triphosphates (dNTPs) and terminal deoxynucleotidyl 
transferase were from Fermentas (Glen Burnie, MD, 
USA). T4 polynucleotide kinase was from USB Corp. 
(Cleveland, OH, USA). Mung Bean Nuclease was from 
Epicenter (Madison, WI, USA). The original TA cloning 
kit was from Invitrogen/Life Technologies Corp. (Grand 
Island, NY, USA). The Big Dye Terminator version 3.1 
Cycle Sequencing Kit was from Applied Biosystems/Life 
Technologies Corp (Grand Island, NY, USA). Polyclonal 
and monoclonal antibodies of FEN1 and tubulin and 
non-specific IgG were from Abeam (Boston, MA, USA). 
All standard chemical reagents were purchased from 
Sigma-Aldrich (St. Louis, MO, USA) and Thermo 
Fisher Scientific (Pittsburgh, PA, USA). Purified 
Escherichia coli uracil-DNA glycosylase (UDG), APE1, 
pol (3, FEN1, DNA ligase I (LIG I) and a monoclonal 
pol (3 antibody were generous gifts from Dr Samuel 
Wilson at the National Institute of Environmental 
Health Sciences/National Institutes of Health, Research 
Triangle Park, NC, USA. 

Oligonucleotide substrates 

Substrates containing (CAG) 2 o or (CTG) 2 o with a 
deoxyuridine or a tetrahydrofuran (THF), an abasic site 
analog, were designed to mimic a repair intermediate with 
a native or modified abasic site in the context of TNR. The 
THF residue was used to represent an oxidized abasic site 
in this study. The guanine of the first CAG or CTG of a 
substrate was substituted with a THF residue or a 
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deoxyuridine residue. A substrate with a native abasic site 
was generated by incubating an oligonucleotide contain- 
ing a deoxyuridine residue, with E. coli UDG (2.5 nM) at 
37°C for 15 min. Substrates were constructed by annealing 
the damaged strands to their template strands at a molar 
ratio of 1:1.5. Substrates with a perfect 15-nt base-paired 
hairpin or a preformed (CTG) 7 or a (CAG) 7 or a (CTG) 2 5 
hairpin in their template strands were constructed to 
mimic the repair intermediates containing a template 
hairpin with or without a S'-downstream flap. The down- 
stream primers contained a S'-phosphate-THF alone or a 
S'-phosphorylated THF with a 21-nt-dT flap or a (CAG) 7 
or a (CTG) 7 or a (CAG)i 2 or a (CAG) 2 5 flap/hairpin. 
Upstream and downstream primers were annealed to 
their templates at a molar ratio of 1:1:1.5. Substrates 
were radiolabeled at the 5'-end of the damaged strand or 
that of the upstream primer or at the 3'- or 5'-end of the 
template strand for measuring various types of enzymatic 
activities. The sequences of oligonucleotide substrates are 
listed in Supplementary Table SI. DNA size markers that 
correspond to the cleavage products of Mung Bean 
Nuclease and the products of pol (3 nucleotide insertions 
were synthesized and purified by polyacrylamide gel elec- 
trophoresis (PAGE). A DNA fragment that contained 
(CAG) 20 without any DNA base lesions was used as a 
marker for DNA fragment analysis. 

In vitro BER in mouse embryonic fibroblast cell extracts 

Cell extracts were made as described previously (53). 
Briefly, pol (3 null (pol (3 _/_ ) and wild-type pol (3 (pol 
(3 +/+ ) mouse embryonic fibroblast (MEF) cell lines were 
grown to near confluence. Cells were harvested and lysed 
and cell lysates were centrifuged at 12000rpm for 30 min 
to obtain whole-cell extracts that were subsequently 
dialyzed into BER reaction buffer containing 50 mM 
Tris-HCl, pH 7.5, 50mM KC1, 0.1 mM EDTA, 
O.lmg/ml bovine serum albumin and 0.01% Nonidet 
P-40. FEN1 deficient cell extracts were prepared by 
incubating wild-type MEF cell extracts (100 jag) with 
5 jig FEN1 polyclonal antibodies at 4°C overnight. 
Protein-antibody complexes were precipitated with 
protein A agarose beads (Pierce-Thermo Scientific, 
Rockford, IL, USA) for FEN1 depletion. Control cell 
extracts were prepared under the same conditions by 
incubating cell extracts with 5jigIgG. Levels of FEN1, 
pol (3 and tubulin (loading control) were determined by 
immunoblotting. BER was performed by incubating 60 ug 
cell extracts with the (CAG) 20 -THF or a (CAG) 20 -uracil- 
containing substrate (25 nM). A reaction (25 ul) was 
assembled in BER reaction buffer with 5mM MgCl 2 , 
2mM ATP and 50 uM dNTPs. BER reactions were also 
performed by incubating substrates with the pol (3 _/_ cell 
extracts supplied with increasing concentrations of 
purified pol (3. Substrates were pre-incubated with 50 nM 
APE1 at 37°C for 30 min to generate ssDNA break inter- 
mediates. The reaction mixture was assembled on ice and 
incubated at 37°C for 30 min. The reactions were 
terminated by transfer to 95°C for 5 min and the 
reaction mixture was subsequently subjected to protease 
K digestion at 55°C for 30 min. DNA was precipitated and 



dissolved into buffer containing 95% formamide and 
2mM EDTA. Substrates and repair products were 
separated by 15% urea-denaturing PAGE and detected 
using a Pharos FX Plus Phosphorlmager from Bio-Rad. 
Repair products were excised from gels and eluted at room 
temperature overnight with TE buffer (10 mM Tris-HCl, 
pH 7.5 and 1 mM EDTA) and then were precipitated and 
dissolved in TE buffer. Twenty nanograms of the repaired 
DNA was used for subsequent DNA fragment analysis. 
All substrates were 32 P-radiolabeled at the 5'-end of the 
strand that contained a deoxyuridine or a THF residue. 

In vitro BER reconstituted with purified enzymes 

In vitro BER of a native or modified sugar in the context 
of (CAG) 2 o repeats was performed with purified UDG, 
APE1, pol (3, FEN1, LIG I and (CAG) 20 -containing sub- 
strates with a deoxyuridine or a THF residue. The 20-ul 
reaction was reconstituted with the indicated concentra- 
tions of substrates and BER enzymes in BER reaction 
buffer that contained 5mM MgCl 2 , 2mM ATP and 
50 uM dNTPs. Reaction mixtures were assembled on ice 
and incubated at 37°C for 15 min. Reactions were then 
terminated by transfer to 95°C for 10 min in buffer con- 
taining 95% formamide and 10 mM EDTA. Substrates 
and products were separated by 15% urea-denaturing 
PAGE and detected by Phosphorlmager. Repair 
products were isolated for size analysis. Substrates were 
2 P-radiolabeled at the 5'-end of the strand with a 
deoxyuridine or a THF residue. BER mediated by pol (3 
hairpin bypass was reconstituted by incubating purified 
pol (3, FEN1 and LIG I with 25 nM substrates that 
contained a hairpin in the template strands with or 
without a downstream flap/hairpin. Reactions were per- 
formed in BER reaction buffer with 5mMMgCl 2 and 
50 uM dNTPs. To isolate repair products, the template 
or upstream strands of the substrates were biotinylated 
at the y~ or 5'-end. Repair products were incubated 
with Pierce avidin agarose beads (Pierce-Thermo 
Scientific, Rockford, IL, USA) in binding buffer that 
contained 0.1 M phosphate, 0.15MNaCl, pH 7.2 and 
1% Nondet P-40 at 4°C for 2h with rotation. The 
agarose beads were centrifuged at 5000 rpm for 1 min 
and were washed with the binding buffer three times. 
The repaired strands were dissociated and separated 
from their template strands with 0.15MNaOH and 
centrifugation at 5000 rpm for 2 min. Repaired strands 
were then precipitated and dissolved in TE buffer for sub- 
sequent amplification by polymerase chain reaction 
(PCR), cloning and DNA sequencing (Supplementary 
Figure SI). 

Enzymatic activity assays 

Pol (3 DNA synthesis was measured using substrate that 
contained (CAG) 20 repeats with a THF alone or sub- 
strates that contained a template hairpin with or without 
a downstream S'-THF-flap/hairpin. The DNA synthesis 
was measured at 37°C in a 20-ul reaction mixture that 
contained BER reaction buffer with 5mM MgCl 2 and 
50 uM dNTPs. FEN1 cleavage activity was examined in 
the same buffer with 5 mM MgCl 2 and 50 uM dNTPs in 
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the absence or presence of increasing concentrations of pol 
(3 at 37°C for 15min. Substrates and products were 
separated by 15% urea-denaturing PAGE and detected 
by a Phosphorlmager. 

Probing of hairpin structures by Mung Bean Nuclease 
digestion 

The formation of hairpin structures by CAG and CTG 
repeats was probed by incubating 2U Mung Bean 
Nuclease with 200 nM substrates that contained 
(CAG) 2 o or (CTG) 2 o repeats with a THF embedded in 
the first CAG or CTG or in a random DNA sequence. 
Hairpins present in the substrates containing a preformed 
template (CTG) 7 and (CTG) 2 5 hairpin were also 
probed by the enzyme digestion. Substrates were 
pre-incubated with 10 nM APE1 at 37°C for 30min to 
generate a ssDNA breakage intermediate. The 10-jil 
reaction mixture was assembled by incubating 2U 
of Mung Bean Nuclease with 200 nM substrates 
in reaction buffer containing 30 mM sodium acetate 
(pH 4.6), 50 mM NaCl, 1 mM zinc acetate and 0.01% 
Triton X-100. The reaction was incubated at 37°C for 1, 
2, 3, 5 and 8 min and subsequently subjected to protease K 
digestion at 55°C for 30 min to remove Mung 
Bean Nuclease. Substrates and products were separated 
by 15% urea-denaturing PAGE and detected by a 
Phosphorlmager. 

Sizing of repair products with CAG repeats by DNA 
fragment analysis 

Repair products (20 ng) resulting from BER in the context 
of (CAG) 2 o repeats were amplified by PCR with a forward 
primer tagged by a 6-carboxyfluorescein and an untagged 
reverse primer that were annealed to the 5'- and 3'-end of 
random sequence regions of repair products. PCR ampli- 
fication was performed under the following conditions: 
95°C for 10 min, 1 cycle; 95°C for 30 s, 50°C for 30 s and 
72°C for 1.5min, 35 cycles; 72°C for 1 h, 1 cycle. The 
6-carboxyfluorescein-labeled PCR products were then 
subjected to capillary electrophoresis. The size of repair 
products was determined by DNA fragment analysis 
(Florida International University DNA Sequencing Core 
Facility) with Peak Scanner version 1.0 software (Applied 
Biosystems, Foster City, CA, USA). 

Cloning and sequencing of repair products 

The repair products resulting from pol (3 hairpin bypass 
were amplified by PCR under the following conditions: 
95°C for 5 min, 1 cycle; 95°C for 1 min, 54°C for 30 s 
and 72°C for 1 min, 25 cycles; 72°C for 10 min, 1 cycle. 
PCR products were then ligated into the pCR 2.1-TOPO 
vector (Invitrogen/Life Technologies Corp., Grand 
Island, NY, USA). Recombinant plasmids that contained 
repair products were then transformed into SURE 2 
Supercompetent Cells (Stratagene, Santa Clara, CA, 
USA). The plasmids were isolated and purified by the 
Miniprep Kit from Qiagen (Valencia, CA, USA) and 
were subjected to DNA sequencing. PCR sequencing re- 
actions were performed with the Big Dye Terminator 
version 3.1 Cycle Sequencing Kit from Applied 



Biosystems/Life Technologies under the following condi- 
tions: 94°C for 15 s, 50°C for 10 s and 60°C for 4 min, 
35 cycles. The sequences of repair products were 
determined using ABI Genetic Analyzer 3130 version 1 
(Applied Biosystems/Life Technologies Corp) by the 
Florida International University DNA Sequencing Core 
Facility and analyzed with Mac Vector version 12.0.2 
software (Mac Vector Inc., Cary, NC, USA). 

RESULTS 

Pol p promotes CAG repeat deletion during BER 

Pol (3, a central component of BER, removes a native de- 
oxyribose phosphate residue and fills in a single-nucleotide 
gap to fulfill the efficient single nucleotide BER (42). Pol (3 
can also perform multi-nucleotide gap-filling synthesis and 
strand displacement synthesis that leads to the relatively 
inefficient long-patch BER (42). Pol (3 multi-nucleotide 
gap-filling synthesis was found to promote CAG repeat 
expansion by producing extra repeat units (27), whereas 
its slippage synthesis is suggested to be involved in repeat 
deletion. Thus the enzyme plays a pivotal role in 
modulating TNR instability. To further determine how 
pol (3 could be involved in TNR deletion, we initially 
studied deletions in the cell extract-based BER of a 
THF embedded at the first CAG of a (CAG) 20 repeat- 
containing substrate. Experiments were performed using 
pol (3 _/_ (Figure 1A, panel b), pol (3 +/+ cell extracts 
(Figure 1A, panel c) or pol (3 _/_ cell extracts comple- 
mented with lOnM purified pol (3 (Figure 1A, panel d). 
The results revealed that significant amounts of deletion 
products formed with pol (3 +/+ cell extracts and pol (3 _/_ 
cell extracts complemented with purified pol (3 (Figure 1 A, 
panels c and d). However, only a small amount of deletion 
product was produced by pol (3 _/_ cell extracts (Figure 
1A, panel b). The amounts of deletion products increased 
by 3- to 4-fold by the presence of pol (3 (Figure 1 A, panel 
e). The repair mainly resulted in (CAG)i 9 - and (CAG)i 8 - 
repeat containing products with one or two repeat deletion 
(Figure 1 A, panels c and d). To further confirm the role of 
pol P in CAG repeat deletion, we determined the deletions 
in a reconstituted BER by incubating the (CAG) 2 o-con- 
taining substrate with purified APE1, FEN1, LIG I and 
increasing concentrations of pol (3. Consistent with the 
results from the pol (3 cell extracts, the reconstituted 
BER also resulted in deletion of one or two CAG 
repeats with the amounts of the products slightly increased 
with increasing concentrations of pol (3 (Figure IB, panels 
b, c and d). Thus, the results indicate that pol P promotes 
CAG repeat deletion. To exclude the possibility that de- 
letion products could result from PCR artifacts due to 
Taq polymerase DNA slippage, a (CAG) 2 o repeat- 
containing marker without any base lesions was 
PCR-amplified and subjected to sizing analysis under the 
same conditions as the ones used for repair products. 
The results showed that little deletion product was from 
the amplified marker (Figure 1A and B, panel a), 
indicating that repeat deletions were from BER rather 
than from PCR artifacts. 
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Figure 1. CAG repeat deletion during BER. (A) Repair of an abasic 
site in the context of CAG repeats was examined with a (CAG) 2 o- 
containing substrate with a THF embedded in the first CAG. BER 
reactions were reconstituted with cell extracts from pol (3 +/+ , pol P _/ ~ 
MEFs and pol P _/ ~ MEF cell extracts complemented with 10 nM of 
purified wild-type pol p. Gel-isolated repair products were amplified by 
PCR with 6-carboxyfluorescein-labeled primers. PCR products were 
separated by capillary electrophoresis and their size was determined 
by DNA fragment analysis. The products are illustrated as different 
individual peaks. The height of a peak represents the abundance of 



Formation of CAG and CTG repeat hairpin structures 
during BER 

It is proposed that TNR deletion involves the formation 
of hairpin structures on the template DNA strand and 
hairpin bypass by DNA polymerases during DNA repli- 
cation (11,54). Thus, it is conceivable that hairpins may 
also form on the template strands during BER leading to 
pol (3 hairpin bypass DNA synthesis. To test this possibil- 
ity, we initially determined the formation of hairpin struc- 
tures in the damaged and template strands of the 
substrates containing (CAG) 2 o or (CTG) 2 o with a THF 
residue, using Mung Bean Nuclease. The nuclease prefer- 
entially makes cleavages at a single-strand loop region and 
the sites with mismatched base pairs in the stem region of 
a hairpin. A specific cleavage pattern of Mung Bean 
Nuclease on CAG or CTG repeats indicates the formation 
of a hairpin with a specific size. The enzyme cleavage on 
the template strand of the (CAG) 2 o-repeat substrate 
resulted in products with 20, 22, 27, 31, 34 and 37 nt, re- 
spectively (Figure 2A, left panel). The cleavage pattern 
indicated that a (CTG) 7 hairpin formed next to the 
5'-side 20 nt-random sequence of the template strand. 
The hairpin consisted of a one-CTG loop and a 
six-CTG repeat stem as indicated (Figure 2A). The 
enzyme cleavage on the damaged strand mainly generated 
products with 25, 29, 32, 34 and 36 nt, respectively 
(Figure 2A, right panel), indicating the formation of a 
(CAG) 7 repeat hairpin next to the 3'-side 20 nt-random 
sequence of the damaged strand. The hairpin was 
composed of a (CAG) 3 -repeat loop and a stem with 
(CAG) 4 repeats (Figure 2A). The nuclease cleavage on 
the template strand resulted in products with 25, 29, 32, 
34, 37 and 40 nt (Figure 2B), respectively. The cleavage 
pattern demonstrated the formation of a (CAG) 8 repeat 
hairpin next to the 5'-side 20-nt random sequence of the 
template strand. The hairpin contained a (CAG) 4 -repeat 
loop region and a stem region with (CAG) 4 repeats 
(Figure 2B). The nuclease digestion of a substrate con- 
taining random sequence failed to generate any cleavage 
products (Supplementary Figure S2A). The results 
indicated that the formation of hairpins on both the 
damaged and template strands during BER is repeat 
sequence-dependent. 

Pol p bypasses a template hairpin effectively, thereby 
facilitating CAG repeat deletions during BER 

Since TNR deletion is proposed to be mediated by DNA 
polymerase bypass of a template hairpin generated during 
DNA replication (54), a similar mechanism could also 
be adopted by pol (3 to mediate TNR deletion during 
BER. To test this hypothesis, pol (3 DNA synthesis was 



Figure 1. Continued 

repair products. The sizes of repair products are illustrated in nucleo- 
tides and in number of repeat units. (B) BER reactions were 
reconstituted by incubating 25 nM substrate radiolabeled at the 5'-end 
of its damaged strand with pol P (2.5 and 5nM), 50nM APE1, lOnM 
FEN1 and 5nM LIG I along with 50jiM dNTPs under the conditions 
described in the 'In vitro BER in mouse embryonic fibroblast cell 
extracts' of 'Experimental Procedures'. 
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Figure 2. Formation of trinucleotide repeat hairpins during BER. (A) Formation of a template hairpin on the (CAG) 2 o repeat substrate with a THF 
residue was probed by Mung Bean Nuclease digestion (left panel). The substrate was radiolabeled at the 5 ; -end of its template strand and was precut 
by 10 nM APE1. Subsequently, the substrate was incubated with 2U of Mung Bean Nuclease at 1-, 2-, 3-, 5-, 8-, 10- and 15-min time intervals (lanes 

2- 8). Lane 1 represents the undigested substrate. Lane 9 represents synthesized size markers (M) with 18, 29, 38, 56 and 74 nucleotides, respectively. 
Hairpins formed on the damaged strand were probed by the enzyme digestion of the substrate radiolabeled at the 3 7 -end of the strand (right panel, 
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A hairpin deduced by a specific nuclease cleavage pattern and the nuclease digestion sites is illustrated schematically beside the gel. 
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examined with substrates containing a template hairpin 
consisting of a 6-nt loop and a 15-nt perfect base-paired 
stem (Figure 3A and B, lanes 2—4) or a template hairpin 
with (CAG) 7 (Figure 3A and B, lanes 6-8) or (CTG) 7 
(Figure 3 A and B, lanes 11-13) or (CTG) 25 (Figure 3 A 
and B, lanes 16—18). These substrates mimic the BER 
intermediates containing a small or large template 
hairpin with or without a S'-downstream flap that result 
from APE1 S'-incision and DNA slippage. The formation 
of the hairpins on both the template and damaged strands 
of these substrates was verified by Mung Bean Nuclease 
digestion (Supplementary Figure S2B-S2D). Substrates 
were radiolabeled at the 5'-end of the upstream primers 
for characterizing any effects of downstream flaps/ 
hairpins on pol (3 primer extension for its hairpin 
bypass. With all the substrates, pol (3 exhibited efficient 
multi-nucleotide synthesis (illustrated schematically by red 
lines) indicating that it readily bypassed the template 
hairpins with varying sizes (Figure 3A and B, lanes 2, 6, 
11 and 16). For both (CAG) 7 and (CTG) 7 template 
hairpins, pol (3 inserted five to seven repeat units (Figure 
3 A, lanes 6, 7, 11 and 12), indicating that the enzyme 
passed through almost the entire length of the hairpins. 
In contrast, pol (3 only inserted 3-10 repeats in bypassing 
(CTG) 25 hairpin (Figure 3 A and B, lanes 16 and 17), sug- 
gesting that the enzyme can skip over the hairpin by 
passing through its portion. Interestingly, the presence 
of FEN1 did not affect pol (3 DNA synthesis on the sub- 
strates without a downstream flap, but inhibited the DNA 
synthesis on the substrates with a S'-flap (compare lanes 2 
and 3, 6 and 7, 11 and 12, and 16 and 17 of Figure 3B). 
These results suggested that FEN1 suppressed pol (3 
hairpin bypass synthesis, possibly by competing with the 
polymerase to bind a flap. To determine whether pol (3 
hairpin bypass could result in repeat deletions, BER reac- 
tions were reconstituted using the same set of template 
hairpin-containing substrates and purified pol P, FEN1 
and LIG I. The results showed that pol (3 bypass of the 
perfect base-paired hairpin and the (CTG) 2 5 hairpin led to 
repair products with deletions of 4-39 and 3-63 nt (1-21 
repeats), respectively, which were identified by DNA 
sequencing (Figure 6 A and B). Consistent with the obser- 
vation, pol (3 bypass of a (CAG) 2 5 hairpin also inserted 1- 
6 repeats and resulted in 3-60 nt (1-20 repeats) 
(Supplementary Figure S3). Surprisingly, pol (3 bypass 
of (CTG) 7 and (CAG) 7 hairpins failed to cause any 
repeat deletion (Figure 6A). This indicated that it 
passed through only a portion of the large hairpins 
and the entire small hairpins. Thus, pol (3 can bypass a 
large hairpin directly leading to repeat unit loss, but can 
faithfully copy almost all the repeats that constitute a 
small hairpin. However, other repair steps that can 
shorten the repeats may also contribute to repeat 
deletion. During BER, this could result from FEN1 flap 
cleavage activity. 

FEN1 promotes CAG repeat deletion during BER 

FEN1 has been found to play controversial roles in 
modulating TNR stability in replication and BER. 
FEN1 cleavage on a S'-TNR-flap during DNA replication 



is crucial for preventing TNR expansion (24,55,56). 
However, FEN1 alternate cleavage of a S'-short flap 
attached to a hairpin during long-patch BER, promotes 
CAG repeat expansion (27). FEN1 flap cleavage may also 
be involved in repeat deletion during BER by shortening 
repeat length. To test this possibility, we initially examined 
FEN1 cleavage during BER of a THF embedded in 
(CAG) 20 repeats. We found that with increasing concen- 
trations of pol (3 (2.5, 5 and lOnM), FEN1 (5, 10 and 
25 nM) cleaved up to four CAG repeats in the presence 
of pol (3 (Figure 4A, lanes 4—12). Pol P alone failed to give 
any cleavage products (Figure 4A, lane 3) indicating the 
products were FEN1 specific. To further determine if the 
cleavage products may result from FEN1 cleavage in 
the presence of a template hairpin, we examined its 
cleavage on a 5 / -THF-(CAG) 7 or 5 / -THF-(CTG) 7 or 
5 / -THF-(CAG) 25 or 5 / -THF-(CTG)25 flap/hairpin in the 
presence of a template (CAG) 7 or (CTG) 7 or (CTG) 25 or 
(CAG) 25 hairpin (Figure 4B and Supplementary Figure 
S3C). We found that FEN1 removed up to five or six 
CAG or CTG repeats from the (CAG) 7 or (CTG) 7 flap/ 
hairpin in the presence of pol (3 (5— 25 nM) (Figure 4B, 
lanes 1—6) which was similar to its cleavage during 
BER. Interestingly, FEN1 removed up to seven to eight 
repeats from the (CAG) 25 or (CTG) 25 flap/hairpin (Figure 
4B, lanes 7—9 and Supplementary Figure S3C) suggesting 
that a large hairpin allowed FEN1 to remove more repeats 
for larger deletion. To further determine if FEN1 cleavage 
is essential for repeat deletion, we examined repeat 
deletion in the absence and presence of FEN1 with a sub- 
strate containing a deoxyuridine that substituted the first 
guanine of (CAG) 20 repeats (Figure 5). We reasoned that 
removal of the base lesion by UDG would create an abasic 
site that results in a native 5'-sugar phosphate and a 
single-nucleotide gap, and these can be directly removed 
by pol P deoxyribose phosphate lyase and filled by pol P 
gap-filling synthesis without the need of FEN1 cleavage. 
Thus, no deletion products should be detected because the 
repeat length cannot be shortened by pol p. However, if 
FEN1 is included in the repair, it could remove short 
CAG repeats from the 5'-end of a downstream hairpin, 
leading to shortening of repeats and deletions. In support 
of this idea, we identified a small amount of deletion 
product in the absence of FEN1 (Figure 5, panel b), but 
significant amounts of deletion products in the presence of 
10 nM FEN1 (Figure 5, panel c). Quantitative analysis 
demonstrated that FEN1 increased the deletion by 
~3-fold (Figure 5, panel d). In contrast, reduction of 
FEN1 level in pol P +/+ MEF cell extracts by -5-fold 
through immuno-depletion (Supplementary Figure S4) 
decreased the amount of deletion products only by 
~2-fold (Figure 5, panels f-i). The immuno-depletion of 
FEN1 did not affect pol P level in the cell extracts 
(Supplementary Figure S4) indicating that reduction of 
deletion products was specifically from FEN1 deficiency. 
Thus, FEN1 is not essential for, but promotes TNR 
deletion. Our results indicate that FEN1 plays an im- 
portant role in facilitating CAG repeat deletion by 
cooperating with pol P hairpin bypass synthesis during 
long-patch BER. 
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Figure 3. Hairpin bypass synthesis of pol |3. (A) Pol P hairpin bypass synthesis and BER were examined in the context of a template (CAG) 7 or a 
(CTG) 7 hairpin or a (CTG) 2 5 hairpin or a template hairpin with a 6-nt loop and 15-nt perfect base pairs at its stem region. The substrates contained 
THF residues at the 5'-end of the downstream primers and were radiolabeled at the 5'-end of the upstream primers. BER reactions were reconstituted 
with 10 nM pol p, 10 or 25 nM FEN1 and 5nM LIG I in the presence of 50 uM dNTPs under the conditions described under 'Experimental 
Procedures' (lanes 4, 8, 13 and 18). Lanes 1, 5, 10 and 15 represent substrates only. Lanes 2, 6, 11 and 16 correspond to the reaction mixture with 
10 nM pol p. Lanes 3, 7, 12 and 17 represent the reaction mixture in the presence of 10 nM pol P and 10 or 25 nM FEN1. Lanes 9, 14 and 19 
represent size markers of pol P DNA synthesis. In all the reactions, 25 nM DNA substrates were used. (B) Pol P DNA synthesis and repair products 
were determined with substrates that contained a downstream 5 7 -THF containing (CTG) 7 or (CAG) 7 or (CAG) 2 5 flap/hairpin or 15 dT flap with a 
template hairpin. Lanes 1, 5, 10 and 15 represent substrates only. Lanes 2, 6, 11 and 16 correspond to the reaction mixtures with 10 nM pol p. Lanes 
3, 7, 12 and 17 represent the reaction mixture in the presence of 10 nM pol P and 10 or 25 nM FEN1. Lanes 4, 8, 13 and 18 represent the reaction 
mixture reconstituted with 10 nM pol p, 10 or 25 nM FEN1 and 5nM LIG I. Lanes 9, 14 and 19 represent size markers of pol P DNA synthesis. In 
all the reactions, 25 nM S'-radiolabeled substrates were used. The substrates are illustrated schematically above the gel. Pol P DNA synthesis 
products are illustrated by dotted lines. 
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DISCUSSION 

In this study, for the first time, we discovered that CAG 
repeats can be deleted during BER (Figures 1, 5 and 6). 
We provided direct evidence that repeat deletions are 
generated in both cell extract-based and reconstituted 
BER (Figure 1) and that this process is accomplished by 
pol (3 hairpin bypass synthesis in coordination with FEN1 
alternate cleavage. We demonstrated that during BER, 



hairpin structures are generated on the damaged and 
template strands of (CAG) 2 o repeats (Figure 2). These 
hairpins govern TNR deletion by cooperatively interact- 
ing with pol (3 hairpin bypass and FEN1 alternate flap 
cleavage. We discovered that pol P readily bypasses a 
small portion of a large template hairpin, directly resulting 
in loss of large size CAG repeats (Figure 3 A and B, lanes 
16 and 17; Figure 6); however, pol (3 can pass through 
almost the entire length of a small template hairpin 
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causing the loss of only one or two repeats (Figure 3A and 
B, lanes 7, 8, 1 1 and 12; Figure 6). With the unique hairpin 
bypass of pol (3, repeat deletions will have to be ultimately 
fulfilled through FEN1 cleavage. In supporting this 
notion, we found that FEN1 alternate flap cleavage 
removed up to four CAG repeats in the presence of a 
small template hairpin (Figure 4A), while pol (3 hairpin 
bypass only inserted three or four repeats in the 
presence of FEN1 (Figure 3B). Thus, FEN1 removes 
more repeats than those inserted by pol (3 leaving a gap 
that led to the formation of a loop containing one or two 
repeats on the template strand for small deletions. 
Consistent with this idea, we observed that FEN1 effect- 
ively cleaved six to seven repeat units from a pre-formed 
(CTG) 7 or (CAG) 7 flap/hairpins (Figure 4B) and signifi- 
cantly stimulated small repeat deletions (Figure 5). With 
longer repeats [i.e. (CAG) 2 5 or (CTG) 2 s] that can form a 
large template hairpin, pol (3 skips over the hairpin by 
passing through a small portion of the hairpin, This 
allows it to perform a strand-displacement synthesis to 
displace a longer repeat flap for FEN1 cleavage, thereby 
causing larger sizes of deletions (Figures 3B, 4B and 6). 
Our data support a model for TNR deletion during BER 



of oxidative DNA base lesions where ssDNA breakage is 
induced by OGG1 and APE1 (Figure 7). This in turn leads 
to DNA slippage and the formation of a S'-downstream 
hairpin, template hairpin and multi-nucleotide gap. For 
short repeats, a small template hairpin allows pol (3 to 
pass through the large portion of the hairpin. FEN1 
removes a short S'-flap associated with a small down- 
stream hairpin, leaving a gap. This further allows the for- 
mation of a loop with one or two repeats on the template 
leading to a ligatable nick, which upon nick sealing affords 
a repeat deletion (Figure 7, sub-pathway on the left). For a 
long repeat tract, ssDNA breakage leads to the formation 
of a large template hairpin that allows pol [3 to bypass only 
one small portion of the hairpin. The polymerase then 
skips over the hairpin to perform a strand displacement 
synthesis generating a long repeat flap that is removed by 
FEN1 leading to a large repeat deletion (Figure 7, right 
sub-pathway). 

In this study, we also provide the first evidence that 
CAG repeat deletion during BER exhibits a length de- 
pendency with a relatively large deletion induced by long 
repeats that is similar to its expansion (1). This suggests 
that TNR deletion and expansion share the same 
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underlying molecular basis, i.e. the formation of hairpin 
structures. Our results indicate that the formation of a 
series of hairpin structures with varying sizes and stability 
dependent upon their length appear to govern TNR 
deletion. Thus, it is of interest to understand the length 
dependency of TNR instability mediated by BER, as this 
will further provide a basis for therapeutic purposes of 
TNR-related human diseases by manipulating the length 
of TNR through DNA repair. 

Our observation of pol (3 passing through one part or an 
entire length of a template hairpin suggests that the poly- 
merase can use its strand-displacement synthesis to pass 
through a hairpin. In this manner, pol (3 may destabilize 
hairpin structures, thereby preventing repeat deletion. 
It is possible that BER cofactors and the proteins from 
other repair pathways that can modulate pol (3 strand- 
displacement synthesis may help prevent or facilitate 
TNR deletion. For example, BER cofactor HMGB1 (57) 
that stimulates pol (3 strand-displacement synthesis may 
inhibit TNR deletion. However, proteins that can bind 
and stabilize hairpin structures such as MSH2/MSH3 
(25,26) may inhibit pol (3 strand-displacement within a 
template hairpin and promote pol (3 skip-over of the 
hairpin facilitating repeat deletion. In contrast, DNA 
helicases that can destabilize hairpins, such as Bloom 



syndrome protein and Werner syndrome protein (58-60), 
may help pol (3 to pass through the entire length of a 
hairpin, thus preventing repeat deletion. The mechanisms 
by which BER proteins modulate TNR deletion through 
cooperating with the proteins from other repair pathways 
need to be further elucidated. 

Cellular oxidative stress may induce single or multiple 
DNA base lesions at other locations, such as in the middle 
of repeat tracks or at the 3'-side of repeats, the locations of 
base lesions could result in the formation of different sizes 
of hairpins at specific sites of repeat tracks. This may 
modulate pol (3 hairpin bypass and FEN1 cleavage with 
different efficiency. It is also conceivable that multiple 
rounds of base lesion repair in the context of TNR could 
lead to the formation and accumulation of multiple 
template hairpins that could be processed by BER 
enzymes in a manner different from the one used for pro- 
cessing a single hairpin, resulting in large size deletion. This 
is implicated by a recent report showing that an abasic 
lesion located at various locations of CAG/CTG repeats 
is repaired by BER with different efficiency (61). The 
effects of the numbers and positions of DNA base lesions 
on TNR instability need to be further investigated. 

Our study reveals that pol (3 hairpin bypass synthesis 
cooperates with FEN1 cleavage of a S'-hairpin causing 
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deletion of CAG repeats. Our previous work 
demonstrated that a robust pol (3 synthesis generates 
extra CAG repeats that can be ligated with a 5'-down- 
stream hairpin shortened partially by FEN1 alternate 
flap cleavage promoting CAG repeat expansion (27). 
These findings suggest that repeat number synthesized 
by pol P and that cleaved by FEN1 governs the 
outcome of TNR instability as to expansion or deletion 
during BER. We suggest that expansion will occur if pol (3 
synthesizes more repeats than those cleaved by FEN1, 
whereas deletion will occur if FEN1 removes more 
repeats than pol (3 synthesizes. 

In summary, in this study, we identified an important 
role of oxidative DNA damage and BER in CAG repeat 
deletion for the first time. We provide the first evidence 
that a base lesion-induced ssDNA breakage can lead to 
the formation of a hairpin in both the template and 
damaged strands of TNR tracks and that pol (3 employs 
its strand-displacement synthesis to pass through a 
portion or the entire length of the template hairpin 
based upon the size of hairpin structures. In contrast, 
FEN1 uses its alternate cleavage to process the hairpin 
on the damaged strand. Our results reveal that the cooper- 
ation between pol (3 hairpin bypass and FEN1 alternate 
cleavage results in small and large deletions. Our work 
further suggests that TNR deletion during DNA base 
damage and repair is governed by the interactions 
between TNR hairpins, pol (3 unique hairpin bypass and 
FEN1 alternate cleavage. 
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